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Abstract 

Disjunctive Logic Programming (DLP) is a very expressive formalism: it allows for expressing every 
property of finite structures that is decidable in the complexity class Ef (=NP^^). Despite this high 
expressiveness, there are some simple properties, often arising in real-world applications, which 
cannot be encoded in a simple and natural manner. Especially properties that require the use of 
arithmetic operators (like sum, times, or count) on a set or multiset of elements, which satisfy some 
conditions, cannot be naturally expressed in classic DLP. 

To overcome this deficiency, we extend DLP by aggregate functions in a conservative way. In 
particular, we avoid the introduction of constructs with disputed semantics, by requiring aggregates 
to be stratified. We formally define the semantics of the extended language (called DLP''^), and 
illustrate how it can be profitably used for representing knowledge. Furthermore, we analyze the 
computational complexity of DLP'^, showing that the addition of aggregates does not bring a higher 
cost in that respect. Finally, we provide an implementation of DLP"^ in DLV- a state-of-the-art DLP 
system - and report on experiments which confirm the usefulness of the proposed extension also for 
the efficiency of computation. 

KEYWORDS: Disjunctive Logic Programming, Answer Set Programming, Aggregates, Knowledge 
Representation, Implementation 



1 Introduction 

Disjunctive Logic Programs (DLP) are logic programs where (non-monotonic) negation 
may occur in the bodies, and disjunction may occur in the heads of rules (.Minker 1982I ). 
This language is very expressive in a precise mathematical sense: under the answer set se- 
mantics dGelfond andLifschitz 199 ft it allows to express every property of finite structures 
that is decidable in the complexity class (=NP^^) ( |Eiter, Gottlob, and Mannila 19971 . 
Therefore, under widely believed assumptions, DLP is strictly more expressive than nor- 
mal (disjunction-free) logic programming, whose expressiveness is limited to properties 
decidable in NP, and it can express problems which cannot be translated to satisfiability of 
CNF formulas in polynomial time. Importantly, besides enlarging the class of applications 



* A preliminary version of this work appeared in the Proceedings of IJCAI-03. 
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which can be encoded in the language, disjunction often allows for representing problems 
of lower complexity in a simpler and arguably more natural fashion, cf. (lEiter et al. 2000l l. 

The problem. Despite this high expressiveness there are some simple properties, often 
arising in real-world applications, which cannot be encoded in DLP in a simple and natural 
manner Among these are properties which require the application of arithmetic operators 
such as count, sum, or min on a set of elements satisfying some conditions. 

Suppose, for instance, that you want to know if the sum of the salaries of the employ- 
ees working in a team exceeds a given budget (see Team Building in Section [3]i. Using 
standard DLP, one first has to define an order over the employees, yielding a successor 
relation. Then, one has to define a sum predicate in a recursive way using this successor 
relation, computing the sum of all salaries, and compare its result with the given budget. 
This approach has two drawbacks: (1) It is bad from the KR perspective, as the encoding 
is not immediate and not natural at all. In particular, an ordering or successor relation of- 
ten is not available and has to be provided in an explicit manner (2) It is inefficient, as the 
(instantiation of the) program is quadratic (in the cardinality of the input set of employees). 

Thus, there is a clear need to enrich DLP with suitable constructs for the natural repre- 
sentation of such properties and to provide means for an efficient evaluation. 

Contribution. We overcome the outlined deficiency of DLP. Instead of inventing new 
constructs from scratch, as in some approaches in the literature (e.g., jSimons et al. 2002b ). 
we extend the language with aggregate functions, like those studied in the context of 
databases, and implement them in DLV (I Leone et al. 2006l l - a state-of-the-art Disjunc- 
tive Logic Programming system. The main advantages of this approach are that extensibil- 
ity of the language (both syntactically and semantically) is straightforward, that aggregate 
functions are widely used, for instance in database query languages, and that many issues 
arising from the use of aggregates are well-understood. 
The main contributions of this paper are the following: 

• We extend Disjunctive Logic Programming by aggregate functions and formally de- 
fine the semantics of the resulting language, named DLP-^. Actually, we introduce 
aggregates in the full DLV language, that is, DLP"^ includes also weak constraints 
dBuccafurri et al. 2000] i. 

• We address knowledge representation issues, showing the impact of the new con- 
structs and describe ways how they can be employed profitably on relevant prob- 
lems. We also highlight the usefulness of assignment aggregates, a new feature of 
DLP-^, which is not supported by other ASP systems with aggregates. 

• We analyze the computational complexity of DLP-^. We consider DLP-^ programs 
with and without weak constraints. Importantly, it turns out that in both cases the 
addition of (stratified) aggregates does not increase the computational complexity, 
which remains the same as for reasoning on aggregate-free programs. 

• We provide an implementation of DLP-^ in the DLV system, deriving new algo- 
rithms and optimization techniques for efficient evaluation. 

• We report on experimentation, evaluating the impact of the proposed language ex- 
tension on efficiency. The experiments confirm that, besides providing relevant ad- 
vantages from the knowledge representation point of view, aggregate functions can 
bring significant computational gains. 
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• We compare DLP-^ with related work proposed in the Hterature. 

The resuh of this work is a concrete and powerful tool for knowledge representation and 
reasoning, enhancing the modeling features of standard DLP and Answer Set Programming 
(ASP) systems. 

DLP-^, as described in this article, requires aggregates to be stratified, that is, predicates 
defined by means of aggregates are not allowed to mutually depend on each other The 
reason is that the set of stratified aggregate programs is the largest class on which all major 
semantics proposed in the literature coincide. Moreover, the introduction of unstratified 
aggregates causes a computational overhead in some cases, while the computational com- 
plexity of the reasoning tasks remains the same if stratified aggregates are introduced. (See 
Section lTTI for a discussion about this issue.) 

It is worthwhile noting that, compared with other implementations of aggregates in DLP 
and ASP, the language of our system supports some extra features which turn out to be 
very useful in practice for KR applications. For instance, the Fastfood problem, described 
in Section[3l is represented naturally and compactly in our language, while its encoding in 
the language of other DLP and ASP systems seems to be more involved causing compu- 
tation to be dramatically less efficient, due to their more severe safety restrictions (domain 
predicates), and also to the lack of the "min" aggregate function (see Section lT2b . 

The paper is organized as follows. Section |2] illustrates the DLP-^ language, providing 
a formal specification of both the syntax and the semantics of our extension of DLP with 
aggregates. Section [3] addresses knowledge representation issues, showing the profitable 
employment of aggregate functions in a couple of examples. Section |4] analyzes the com- 
putational complexity of the DLP-^ language. Section |5] addresses some implementation 
issues. Section|6]reports on the results of the experimentation activity. Section|7]discusses 
related works. Finally, in Section[8]we draw our conclusions. 

2 The DLP-^ Language 

In this section we provide a formal definition of the syntax and semantics of the DLP-^ 
language. DLP-^ is an extension of the language of the DLV system by set-oriented (or 
aggregate) functions. Specifically, DLP-^ includes disjunction, default (or non-monotonic) 
negation, integrity and weak constraints, and aggregates0For further background we refer 
to jGelfond and Lifschitz 19911) . jBaral 2003l l. and jLeone et al. 2006l l. 

2.1 Syntax 

We assume sets of variables, constants, and predicates to be given. Similar to Prolog, we 
assume variables to be strings starting with uppercase letters and constants to be non- 
negative integers or strings starting with lowercase letters. Predicates are strings starting 
with lowercase letters or symbols such as —, <, > (so called built-in predicates that have 
a fixed meaning). An arity (non-negative integer) is associated with each predicate. 

^ We do not treat strong negation explicitly. DLV supports this by a simple rewriting technique, adding a con- 
straint :- a, -a for each strongly negated atom -a, where a also occurs in the program. 
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Standard Atoms and Literals. A term is either a variable or a constant. A standard atom 
is an expression p{ti, . . .,t„), where p is a predicate of arity n and ti,. . . ,t„ are terms. A 
standard literal L is either a standard atom A (in this case, it is positive) or a standard atom 
A preceded by the defauh negation symbol not (in this case, it is negative). A conjunction 
of standard literals is of the form Li, . . . , where each L,- (1 < i < k) is a standard literal. 

A structure (e.g. standard atom, standard literal, conjunction) is ground, if neither the 
structure itself nor any substructures contain any variables. 

Sets. A (DLP-^) set is either a symbolic set or a ground set. A symbolic set is a pair 
{Vars : Conj}, where Vars is a comma-separated list of variables and Conj is a conjunc- 
tion of standard literals. Intuitively, a symbolic set {Xxi{X, Y), not p{Y)} stands for the set 
of Z- values making the conjunction a (X, Y), not p{Y) true, i.e., {X -.BY such that a{X, Y)A 
not p{Y) is true}; see Section l23] for details. 

A ground set is a set of pairs of the form (7 : Conj), where f is a list of constants and 
Conj is a ground conjunction of standard literals. 

Aggregate Functions and Aggregate Atoms. An aggregate function is of the form 
f{S), where 5 is a set, and/ is a function name among ^count, #min, ^max, ^sum, 
#times. An aggregate atom is 

where/(5') is an aggregate function, -(i, -<2£ {=, <, <7 >, >}, and Lg and Rg (called 
left guard, and right guard, respectively) are terms. One of "Lg -<i" and "^2 Rg" can be 
omitted. In this case, "0 <" and "< +00" are assumed, respectively. If both -<i, -<2 are 
present, we assume for simpUcity that ^iG {<, <} if and only if {<, <} and that 
both ^1 and -<2 are different from =H 

Example 1 

The following are two aggregate atoms. The latter contains a ground set and could be a 
ground instance of the former 

#max{Z : r{Z),a{Z,V)} > Y 

#max{(2 : r{2),a{2,x)), (2 : r(2), fl(2,y))} > 1 

(General) Atoms, Literals and Rules. An atom is either a standard atom or an aggregate 
atom. 

A literal L is an atom A (positive literal) or an atom A preceded by the default negation 
symbol not (negative literal). If A is an aggregate atom, L is an aggregate literal. 
A (DLP-^) rule r is a construct 

ai V ■ ■ ■ y a„ :- bi, . . . ,bk,bk+i, . . .,b,„- 

where ai, . . . ,a„ are standard atoms, bi, . . . ,bk are positive literals, and bk+i, . . . ,b,„ are 
negative literals, and n > 0,m > k > 0, m+n > 1. The disjunction oi V • • • V a„ is the head 
of r, while the conjunction bi, . . . , b,„ is the body of r, bi, . . . ,bk being the positive body 



^ The aggregates not considered are of limited importance, as they impose two upper or two lower guards, of 
which one will be redundant. 
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and bk+i, ■ ■ ■ ,b,„ the negative body. We define //(r) — {ai, . . . ,a„}, B{r) — {bi, . . . ,bm}, 
B^{r) = {bi, . . . ,bk}, and B^{r) = {bk+i, ■ ■ ■ ,bm}. A rule without head literals (i.e. 
« = 0) is usually referred to as an integrity constraint. A rule with an empty body (i.e. 
m = 0) is called a fact, and we usually omit the " :- " sign in this case. 

Weak Constraints. The language of DLV, that we enhance by aggregates in this paper, 
extends disjunctive Catalog by another construct called weak constraint dBuccafurri et al. 2000l l. 
The DLP-^ language allows for a general form of weak constraints also including aggre- 
gate literals. 

We define weak constraints as a variant of integrity constraints. In order to differentiate 
between these two, weak constraints use the symbol instead of In addition, a 
weight and a priority level inducing a partial order among weak constraints are specified. 

Formally, a weak constraint wc is an expression of the form 

bi, . . .,bk,bk+i, ...,b„- [w: I] 

where bi, . . . ,bk are positive literals, bk+i, . . . ,b,n are negative literals, and w (the weight) 
and I (the JeveJ, or layer) are positive integer constants or variables. For convenience, w, /, 
or both can be omitted and default to 1 in this case. 

DLP-^ Programs. A (DLP-^) program V {program, for short) is a set of DLP-^ rules (pos- 
sibly including integrity constraints) and weak constraints. For a program V, let RulesiV) 
denote the set of rules (including integrity constraints), and let WC{V) denote the set of 
weak constraints in V. A program is positive if it does not contain any negative literal. 



2.2 Syntactic Restrictions and Notation 

We begin with two notions of stratification, which make use of the concept of a level 
mapping. Functions 1 1 1 1 from predicates in a program P to finite ordinals are called JeveJ 
mappings of P. 

Negation-stratification. 

A program P is called negation-stratified (Apt et al. 1988 Przymusinski 19881 ), if there 



is a level mapping 1 1 1 1„ of 7^ such that, for each pair p and p' of predicates of P and every 
rule r of P, 

1. if p occurs in B^{r) and p' occurs in H{r), then | |p| |„ < | |p'||„; and 

2. if p occurs in B^{r) and p' occurs in H{r), then \ \p\\„ < \ \p'\\n', and 

3. if and 79' occur in //(r), then 1 1/? 1 1„ = ||/?'||n. 

Aggregate-stratification. 

The idea of aggregate-stratification is that two predicates defined by means of aggregates 
do not mutually depend on one another. 

A DLP-^ program P is aggregate-stratified if there exists a level mapping 1 1 | |a such that 
for each pairp and p' of predicates of P, and for each rule r eP, 

1. if p occurs in a standard atom in B(r) and/?' occurs in //(r), then I |;7||a < ||p'||fl;and 

2. if p occurs in an aggregate atom in Z?(r), and /:>' occurs in //(r), then \ \p\\a < Hp' Ho; 
and 

3. if p and p' occur in //(r), then ||p||fl — \ \p'\\a- 
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Example 2 

Consider a program consisting of a set of facts for predicates a and b, plus the following 
two rules: 

q{X) :-/?(X),#count{y : a{Y,X),b{X)] < 2- 

p{X) ■.-q{x),bixy 

The program is aggregate-stratified, as the level mapping ||fl|| = \\b\\ = 1 \\p\\ ~ 
\\q\\ — 2 satisfies the required conditions. If we add the rule b{X) :-p{X), no such level 
mapping exists and the program becomes aggregate-unstratified, as in this case a level 
mapping would have to satisfy ||^|| > \ \b\ \ > \ \p\ \ > ||^||, hence ||^|| > ||^||. ■ 

Intuitively, aggregate-stratification forbids recursion through aggregates. It guarantees 
that the semantics of aggregates is agreed upon and coherent with the intuition, while the 
semantics of aggregate-unstratified programs is debatable, and some semantic properties 
(like, e.g., existence of answer sets for positive programs) are usually lost. For a more 
detailed discussion, see Section lTTI 

Local and global variables, Safety. For simplicity, and without loss of generality, we 
assume that the body of each rule and weak constraint contains at most one aggregate 
atomJl A local variable of a rule r is a variable appearing solely in an aggregate function in 
r; a variable of r which is not local is called global. A nested atom of r is an atom appearing 
in an aggregate atom of r; an atom of r which is not nested is called unnested. 

A rule or weak constraint r is safe if the following conditions hold: (i) each global 
variable of r appears in a positive unnested standard literal of the body of r; (ii) each local 
variable of r that appears in a symbolic set {Vars : Conj} also appears in a positive literal 
in Conj. Finally, a program is safe if all of its rules and weak constraints are safe! 

Condition (i) is the standard safety condition adopted in Datalog, to guarantee that the 
variables are range restricted dUlknan 19891) . while condition (ii) is specific to aggregates. 

Example 3 

Consider the following rules: 

p{X) -.-qiX, Y,V),Y< #max{Z : r{Z),not a{Z,V)}- 
p{X) :-^(X, Y,V),Y< #sum{Z : nota{Z,S)}- 
p{X) :-^(X, Y,V),T< #min{Z : r{Z),not a{Z,V)}- 

The first rule is safe, while the second is not, since both local variables Z and S violate 
condition (ii). The third rule is not safe either, since the global variable T violates condition 
(i). ■ 

We assume in the following that DLP-^ programs are safe and aggregate-stratified, un- 
less explicitly stated otherwise. 



Note that we do this only to simplify the definitions; our implementation can deal with multiple aggregates in 
one rule. 

^ Note that the safety restrictions apply also to aggregate-free rules and constraints. 
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2.3 Semantics 

Let us first define some notation which is to be used subsequently. Given a DLP-^ program 
V, let U-p denote the set of constants appearing in V, U-^ C U-p the set of the natural num- 
bers occurring in U-p, and Bp the set of standard atoms constructible from the (standard) 
predicates of V with constants in Up>. Given a set X, let 2 denote the set of all multisets 
over elements from X. 

Let us next describe the domains and the meanings of the aggregate functions considered 
in this work: 

#count: defined over 2 , the number of elements in the set. 

#suin: defined over 2 ^, the sum of the numbers in the set; in case of the empty 
set. 

#times: over 2 ^ the product of the numbers in the set; 1 for the empty set. 
#inin, #max: defined over 2 — {0}, the minimum/maximum element in the seto 

If the argument of an aggregate function does not belong to its domain, the aggregate 
evaluates to false (denoted as _L). 

Instantiation. A substitution is a mapping from a set of variables to the set Up of the 
constants in V. A substitution from the set of global variables of a rule r (to U-p) is a global 
substitution forr, a substitution from the set of local variables of a symbolic set S (to U-p) 
is a local substitution for S. Given a symbolic set without global variables S — {Vars : 
Conj}, the instantiation of S is the following ground set of pairs inst{S): {{'-f{Vars) : 
^{Conj)) I 7 is a local substitution foiS}^ 

A ground instance of a rule or a weak constraint r is obtained in two steps: (1) a global 
substitution a for r is applied to r; and (2) every symbolic set S in cr(r) is then replaced 
by its instantiation inst{S). The instantiation Ground{V) of a program V is the set of all 
possible instances of the rules and the weak constraints of V. 

Example 4 

Consider the following program Vi : 

q{l)yp{2,2)- q{2)\/p{2,l)- 
t{X) ■.-q{X),#sum{Y: p{X,Y)} > 1- 

The instantiation Ground{Vi) is the following: 

q{l)yp{2,2)- q{2)yp{2,l)- 
f(l):-^(l),#sum{(l :p(l,l)),(2:p(l,2))}>l• 
f(2):-<?(2),#sum{(l :p(2,l)>,(2 :/.(2,2)>} > 1- 



^ Note that #min and #max can be easily extended to the domain of the strings by considering the lexicographic 
ordering. 

^ Given a substitution cr and a DLP-^ object O (rule, conjunction, set, etc.), we denote by cr{0) the object 
obtained by replacing each variable Z in O by 



8 



W. Faber et al. 



For any program P, Ground{V) denotes the set GroundRules{V) U GmimdWC(V), 
where GroundRules{V) — Grounder) and GroundWC{V) — (J Ground{w). 

r£Rules{V) w&WC{V) 

Note that for prepositional programs, V = Ground{V) holds. 

Interpretations and Models. An interpretation of a DLP-^ program "P is a set of standard 
ground atoms / C B-p. The truth valuation /(A), where A is a standard ground literal 
or a standard ground conjunction, is defined in the usual way. Besides assigning truth- 
values to standard ground literals, an interpretation provides meaning also to ground sets, 
aggregate functions and aggregate literals; the meaning of a set, an aggregate function, 
and an aggregate atom under an interpretation, is a multiset, a value, and a truth-value, 
respectively. Let f{S) be a a ground aggregate function. The valuation of the (ground) set S 
w.r.t. / is the multiset 7(5) defined as follows: Let 5/ = {(fi,- • •,?«} | (fi,- • ■,t„:Conj) € 
S A Conj is true w.rt. /}, then 7(5') is the multiset obtained as the projection of the tuples 
of Si on their first constant, that is I{S) = [fi | (fi, ■ • ■, f„) G 5/]. 

The valuation I(f{S)) of a ground aggregate function /(5') w.r.t. I is the result of the 
application off on I{S). If the multiset 7(5') is not in the domain of/, I(f{S)) = _L (where 
_L is a fixed symbol not occurring in P). 

A ground aggregate atom A = Lg ^if{S) ■<2 Rg is true w.r.t. 7 if: (i) I{f{S)) ^ _L, and 
(ii) the relationships Lg I{f{S)) and I{f{S)) <2 Rg holcS otherwise, A is false. 

Example 5 

Let7bethe interpretation {/(I), 2), 3), 4), g(2, 4), /z(2), /z(3), /i(4)}. Withre- 
spect to the interpretation 7, and assuming that all variables are local, we have that: 

- #count{X : g{X, F)} > 2 is false, because Si for the corresponding ground set is 
{(1), (2)}, so 7(5) = [1, 2] and #count([l, 2]) = 2. 

- #count{X,F : g{X,Y)] > 2 is true, because 5/ = {(1, 2), (1, 3), (1, 4), (2, 4)}, 
I{S) = [1, 1, 1, 2] and #count([l, 1, 1, 2]) = 4. 

- 23 < #times{F : f{X), g{X,Y)} < 24 is true; in this case 5/ = {(2), (3), (4)}, 
7(5) = [2,3,4] and #times([2, 3, 4]) = 24. 

- #sum{A : g{A,B),h{B)} < 3 is true, as we have that 5/ = {(1), (2)}, 7(5) = [1,2] 
and#sum([l,2]) = 3. 

- #sum{A,B : ^(A, B), /i(B)} < 3 is false, since 5/ = {(1, 2), (1, 3), (1, 4), (2, 4)}, 
7(5) = [1, 1, 1, 2] and #sum([l, 1, 1, 2]) = 5.; 

- #min{X : f{X), g{X)} > 2 is false because the evaluation of (the instantiation of) 
{X : f{X),g{X)} w.r.t. 7 yields the empty set, which does not belong to the domain 
of #inin (we have that 7(#min{}) = ±). 

■ 

A ground rule r G GroundRules{V) is satisfied w.r.t. I if some head atom is true w.rt. 
7 whenever all body literals are true w.rt. 7. (If r is an integrity constraint, r is satisfied iff 
its body is false.) A ground weak constraint w £ GroundWC{V) is satisfied w.r.t. I if some 
body literal of w is false w.r.t. 7. 

^ Note that in the implemented system (cf. Section|5) an error will be produced if Lg or Rg are not in U:^ . 
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A model of V is an interpretation M of P such that every rule r G GroundRules{V) is 
satisfied w.r.t. M. A model M of P is (subset) minimal if no model NofV exists such that 
is a proper subset of mH 

Example 6 

Consider the aggregate atom A = #sum{(l : p{2, 1)), (2 : p{2, 2))} > 1 from Example|4] 
Let S be the ground set appearing in A. For the interpretation / = {q{2),p{2, 2), f(2)}, 
I{S) = [2], the application of #sum over [2] yields 2, and A is therefore true w.r.t. /, since 
2 > 1. Indeed, one can verify that / is a minimal model of the program of Example |4] ■ 

Answer Sets. We define the answer sets of DLP-^ programs in three steps, using their 
ground instantiation. First we define the answer sets of standard positive programs (i.e., 
programs without aggregates and without weak constraints), then we give a reduction of 
DLP-^ programs containing aggregates and negation as failure to standard positive ones 
and use it to define answer sets of arbitrary sets of rules, possibly containing aggregates 
and negation as failure. Finally, we specify how weak constraints affect the semantics, 
arriving at the semantics of general DLP^ programs (with negation, aggregates and weak 
constraints). 

Step 1 An interpretation X C B-p is an answer set of a standard positive DLP-^ program 
(without aggregates and weak constraints) V, if it is a minimal model of V. 

Example 7 

The positive program Pi = {aVbVc-} has the answer sets {a}, {b}, and {c}. Its extension 
V2 = {a V b V c- , :-«•} has the answer sets {b} and {c}. Finally, the positive program 
1^3 = {a V b V c- , :- fl- , b:-c- , c:-b-} has the single answer set {b, c}. ■ 

Step 2 The reduct or Gelfond-Lifschitz transform of a DLP-^ program P w.rt. a set 
X C Bj> is the standard positive ground program obtained from GroundRules{P) by 

• deleting all rules r G GroundRules{P) for which a negative literal in B{r) is false 
w.r.t. X or an aggregate literal is false w.rt. X; and 

• deleting all negative literals and aggregate literals from the remaining rules. 

An answer set of a program is a set X C Bj> such that X is an answer set of P^. 
Example 8 

Given the following aggregate-stratified program with negation P4 = 

{d{l)- , a\J b:-c- , 

b :- not fl, not c, T^countjF : d{Y)} > 0- , 
fl V c :- not b, #sum{}' : d{Y)} > !•} 

and / {b,d{l)}, the reduct P'^ is {d{l)- , aV b:-c- , b-}. It is easy to see that / is an 
answer set of P^, and thus an answer set of P4 as well. 

Now consider J ~ {a,d{l)}. The reduct P^ is {d{l)- , a V b:-c-}. It can be easily 

* Note that a model can violate weak constraints. 
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verified that 7 is a model of V^. However, also J' = {d{l)} C / is a model of V^, so J is 
not an answer set of V'l and thus J is not an answer set of V^. 

For K — {c, d{l)}, on the other hand, the reduct V4 is equal to V'l, but K is not an 
answer set of P^: for the rule r : aV b:-c, B{r) C K holds, but H{r) r\K ^ % does not. 
Indeed, it can be verified that / and J are the only answer sets of 7^4. ■ 

Step 3 Given a ground program V with weak constraints WC{V), we are interested in the 
answer sets of Rules{V) which minimize the sum of weights of the violated (unsatisfied) 
weak constraints in the highest priority leveljl and among them those which minimize the 
sum of weights of the violated weak constraints in the next lower level, etc. Formally, this 
is expressed by an objective function H'^{A) for V and an answer set A as follows, using 
an auxiliary function which maps leveled weights to weights without levels: 

fv{l) -1, 

fv(n) =fT,[n-l)-\WC{V)\-wl„, + l, « > 1, 
H'^i.A) = E'B'iifvii) ■ Evv6<(A) ^'^ightiw)), 

where w,^„j and l]^^^^ denote the maximum weight and maximum level over the weak con- 
straints in V, respectively, Nf (A) denotes the set of the weak constraints in level ; that 
are violated by A, and weight{w) denotes the weight of the weak constraint w. Note that 
|WC('P)| • w^jg^ + 1 is greater than the sum of all weights in the program, and therefore 
guaranteed to be greater than the sum of weights of any single level. 

Intuitively, the function /p handles priority levels. It guarantees that the violation of 
a single constraint of priority level / is more "expensive" then the violation of all weak 
constraints of the lower levels (i.e., all levels < /). 

For a DLP-^ program V (possibly with weak constraints), a set A is an (optimal) answer 
set of V if and only if (1) A is an answer set of Rules{V) and (2) H''^{A) is minimal over 
all the answer sets of Rules{V). 

Example 9 

Consider the following program V^, which has three weak constraints; 

a\Jb- 
by c- 

dV nd :- a, c- 

:-#sum{(4:fe)}>3- [1:2] 
:~ a, nd ■ [4 : 1] 
■.r^c,d- [3:1] 

RulesiVc,) admits three answer sets: Ai = {a, c. d}, A2 — {a, c, nd}, and A3 — {/?}. We 
have: H'^'- (Ai) = 3, H'^'- (A2) = 4, H'^'' (A3) = 13. Thus, the unique (optimal) answer set 
is {fl, c, d} with weight 3 in level 1 and weight in level 2. 



Higher values for weights and priority levels mark weak constraints of higher importance. The most important 
constraints are those having the highest weight among those with the highest priority level. 
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2.4 Computing New Values from Aggregates 



Due to the definition of safety in Section |Z2| we could define the semantics of aggregates 
using the standard Herbrand Base and Herbrand Universe. The values returned by aggre- 
gate functions do not extend the Herbrand Universe. 

This restriction, which is also imposed in the language of the Lparse system ( Syrjanen 2002| l 



(see also Section 17. 2t . appears to be severe and limits the expressiveness of the language. 
Suppose, for instance, that the employees of a company are stored by a number of facts of 
the form employ ee{Id, Name, Salary). If the boss wants to know the sum of the salaries 
she pays, a rule 

total(T) : T = ^sum{S,I : employ ee{I,N,S)}- 

would be most intuitively 

However, the above rule is unsafe because of the variable T. Our language thus fails to 
naturally express a simple query which can be easily stated in SQ10. To overcome this 
problem, we introduce the notion of assignment aggregate and make appropriate adjust- 
ments to the notion of safety and semantics. 

Assignment Aggregate. We denote by def''(p) the set of defining rules of a predicate 
p, that is, those rules r in which p occurs in the head. Moreover, the defining program of 
a predicate p, denoted by def''{p), consists of def''(p) and the defining programs of all 
predicates which occur in the bodies of rules in defip). 

An aggregate atom is an assignment aggregate if it is of the form X — f{S),f{S) ~ X, 
orX =/(5') X, where X is a variable and for each predicate p in S, def^ip) is negation- 
stratified and non-disjunctive. 

The intuition of the restriction on the definition of the nested predicates is to ensure that 
these predicates are deterministically computable. 



Relaxed Safety. We slightly relax the notion of safety as defined in Section lZ2l changing 
only condition (i): 

A rule or weak constraint r is safe if the following conditions hold: (i) each global 
variable of r appears in a positive unnested standard literal of the body of r or as a guard 
of an assignment aggregate; (ii) each local variable of r that appears in a symbolic set 
{Vars : Conj} also appears in a positive literal in Conj. Finally, a program is safe if all 
of its rules and weak constraints are safe. 

To adapt the formal semantics to this extension, we enrich the Universe U-p of the pro- 
gram by the set of positive integers which result from the evaluation of an aggregate func- 
tion, with a consequent enlargement of B-p. Note that the (relaxed) safety criterion guaran- 
tees domain independence of rules and weak constraints, which — together with aggregate 
stratification — guarantees a simple (and finite) evaluation. None of the remaining semantic 
notions needs further adaptations. 



We aggregate over I (in addition to S), as otherwise two employees having the same salary would count only 

once in the total. This is also why we allow for multisets. 

Note that also the language of Lparse caimot express this query, cf. Section lT!!) 
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3 Knowledge Representation in DLP-^ 

In this section, we show how aggregate functions can be used to encode several relevant 
problems: Team Building, Seating, and a logistics problem, called Fastfood. Moreover, 
we show how some properties of the input relations (e.g., the cardinaUty) can be simply 
computed by using aggregates, and we describe the encoding of a variant of the Fastfood 
problem. 

Team Building. A project team has to be built from a set of employees according to the 
following specifications: 

(pi) The team consists of a certain number of employees. 

(P2) At least a given number of different skills must be present in the team. 

(pa) The sum of the salaries of the employees working in the team must not exceed 

the given budget. 

(pi) The salary of each individual employee is within a specified limit. 

(p5 ) The number of women working in the team has to reach at least a given number. 

Information on our employees is provided by a number of facts of the form 

emp{EmpId, Sex, Skill, Salary) . The size of the team, the minimum number of different 
skills in the team, the budget, the maximum salary, and the minimum number of women 
are specified by the facts nEmp (A''), nSkill{N), budget{B), maxSal{M), andwomen{W). We 
then encode each property p, above by an aggregate atom A,, and enforce it by an integrity 
constraint containing not A, . 

% Guess whether to take an employee or not. 
in{I) V out{I) :- emp{I, Sx, Sk, Sa) ■ 
% The team consists of exactly employees, (pi) 
■.-nEmp{N),not #count{/ : in{I)} = N- 

% Overall, employees need to have at least M different skills, {p-2) 

:- nSkill{M), not #count{5A: : emp{I, Sx, Sk, Sa), in(I)} > M- 

% The sum of the individual salaries must not exceed the budget B. (ps) 

:- budget{B), not ^sum{5fl. / : emp{I, Sx, Sk, Sa), /«(/)} < B- 

% The max. salary in the team must not exceed the max. allowed salary M. (p4) 

■.-maxSal{M),not #max{5a : emp{I , Sx, Sk, Sa) , in{I)} < M- 
% We have at least W women in the team, (p^) 

■.-women{W),not #count{/ : emp{I,f,Sk,Sa),in{I)} > W- 

Intuitively, the disjunctive rule "guesses" whether an employee is included in the team or 
not, while the five constraints correspond one-to-one to the five requirements pi-p^. Thanks 
to the aggregates the translation of the specification is surprisingly straightforward. 

The example highlights the usefulness of representing both sets and multisets in our 
language; the latter can be obtained by specifying more than one variable in the Vars 
part of a symbolic set {Vars : Conj}). For instance, the encoding of p2 requires a set, 
as we want to count different skills: two employees in the team having the same skill 
count once w.r.t. p2- On the contrary, p^ requires to sum the elements of a multiset : if 
two employees have the same salary, both salaries should be summed up for p^. This is 
obtained by adding the variable /, which uniquely identifies every employee, to Vars. 
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The valuation of {Sa,I : emp{I , Sx, Sk, Sa) , in{I)} gives rise to the set S = {{Sa,I) : 
Sa is the salary of employee I in the team}. The sum function is then applied on the 
multiset of the first component Sa of all the tuples {Sa, I) in S (see Section l23]) . 

Seating. We have to generate a seating arrangement for k guests, with m tables and n 
chairs per table. Guests who like each other should sit at the same table; guests who dislike 
each other should sit at different tables. 

Suppose that the number of chairs per table is specified by nChairs{X) and that person (P) 
and table{T) represent the guests and the available tables, respectively. Then, we can gen- 
erate a seating arrangement by the following program: 

% Guess whether person P sits at table T or not. 

at{P, T) V notMt{P, T) :- person{P), table (T)- 

% The persons sitting at a table cannot exceed the number of chairs there. 
■.-table{T),nChairs{C),not #count{P : at{P,T)} < C- 
% A person is to be seated at precisely one table. 

person(P), not ^count {T : at{P,T)} — 1- 
% People who like each other should sit at the same table... 
:- like{Pl,P2), at{Pl, T),not at{P2, T)- 
% ...while people who dislike each other should not. 
:- dislike{Pl,P2),at{Pl, T), at{P2, T)- 

This encoding does not make as massive a use of aggregates as Team Building, but it is 
useful to highlight a readability issue, which also has impact on efficiency, as discussed in 
Section|6] The last aggregate atom above could be replaced by 

% A person cannot sit at two different tables... 
■.-person{P),at{P, T),at{P, Tl),T ^Tl- 
% ...and has to sit at one table at least. 

seated{P) :- at{P, T)- 

:~ person(P), not seated{P)- 

This is less concise and arguably less readable. Moreover, the number of ground rules 
and constraints necessary for expressing the same statement would grow from ^ * m to 
k * m * [m — 1) -\- k * m + k, where k is the number of guests and m the number of tables. 

Fastfood. The "Fast Food" problem, number 662 of volume VI of the ACM programming 
contests problem set archive ( http : / / acm . uva .es/p/v6/662. html ), is specified 
as follows: 

The fastfood chain McBurger owns several restaurants along a highway. Recently, they have de- 
cided to build several depots along the highway, each one located at a restaurant and supplying 
several of the restaurants with the needed ingredients. Naturally, these depots should be placed so 
that the average distance between a restaurant and its assigned depot is minimized. You are to write 
a program that computes the optimal positions and assignments of the depots. 

To make this more precise, the management of McBurger has issued the following specification: 
You will be given the positions of n restaurants along the highway as n integers d\ < d2 < ■ ■ ■ < d„ 
(these are the distances measured from the company's headquarter, which happens to be at the same 
highway). Furthermore, a number k{k < n) will be given, the number of depots to be built. 
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The k depots will be built at the locations of k different restaurants. Each restaurant will be as- 
signed to the closest depot, from which it will then receive its supplies. To minimize shipping costs, 
the total distance sum, defined as 



must be as small as possible. 

We assume that instances are given as facts of the form restaumnt{res, d) representing a 
restaurant uniquely named res at kilometer d of the highway. Moreover, a fact nDepots{k) 
is included which specifies k, the number of depots to be built. 

% A restaurant can be a depot or not. 

depot{Res,D) V notdepot(Res,D) :- restaurant{Res,D)- 

% The number of depots must be as specified. 

nDepots{K),not #count{Dep,D : depot{Dep,D)} = K- 
% Determine the serving depot for each restaurant. 
serves{Dep, Res, D) :- restaurant{Res, ResD) , depot{Dep, DepD), 
distance{ResD, DepD, D), 

#min{F : depot{Depl,DepDl),distance{DepDl,ResD, Y)} = D- 
% Minimize the serving distances. 
:~ serves {Dep, Res, D) ■ [D :] 
% Auxiliary predicate. 

distance{X, Y, D) :- restaurant {Resl,X), restaurant{Res2, Y),X > Y,X = Y + D- 
distance(X, Y, D) :- restaurant {Resl,X), restaurant{Res2,X),X < Y,Y = X + D- 

In the definition for distance, we have used atoms involving built-in predicates >, <, 
and which are defined on a bounded set of integer constants. That is, these predicates 
define greater than, less than or equal, and sum, respectively, on the finite set of integers 
[0,n]. For this example domain, each instance implies an upper bound for the integers that 
can occur, and we assume that the maximum integer n is chosen appropriately for each 
instance. Note that atoms like X = Y + D are quite different from assignment aggregates: 
For the former, an admissible value range has to be specified expUcitly (n for the integer 
range on which the predicate is defined), while for the latter a value range is not necessary. 

Note that this example involves minimization in two different ways: On the one hand, 
the serving distance for a restaurant is the minimum distance to a depot. On the other hand, 
we look for a solution which minimizes the sum of serving distances. It is important to 
note that the first minimum (choosing the closest depot for every restaurant) refers to a 
fixed depot assignment, whereas the second (minimizing the sum of serving distances) is 
to be determined with respect to all possible depot assignments. It is therefore not possible 
to merge the two criteria, and indeed we use different constructs (an aggregate and a weak 
constraint) for representing them. 

Input Cardinality. 

In several problems, it is important to determine the cardinality of input relations. Doing 
so is simple using an assignment aggregate: If the input predicate is p and has arity n, we 
can write 



n 




I di — (position of depot serving restaurant (') | 



cardinality 4? {C) :- #count{Xl, . . . ,Xn : p{Xl, . . . ,Xn)} = C- 
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Note that in general this can not be achieved without assignment aggregates as defined 
in Section 123] Without these, one could add some kind of domain predicate restricting the 
range of the variable C: 

cardinality ^{C) :- ^count{Xl, . . . ,Xn : p{Xl, . . . ,Xn)} — C, domain{C)- 

However, since the maximum cardinality of p is not known in advance, the size of domain 
would have to be countably infinite, which is not feasible. 

In a similar way, again by assignment aggregates, one may compute the sum of the 
values of an attribute of an input relation (e.g., compute the sum of the salaries of the 
employees). 

Fastfood Solution Checking. 

Consider a slight variation of the Fastfood problem introduced above: Instead of comput- 
ing the optimal solutions, one has to check whether a given depot assignment is optimal and 
compute a witness (a depot assignment with smaller distance sum) if it is not. This problem 
features in the First Answer Set Programming System Competitioillf (IGebser et al. 2007] !. 



Here, in addition to facts restaurant{res, d) (as in the Fastfood problem input), also facts 
depot{dep,d) will be in the input, representing the depot assignment to be checked for 
optimality. nDepots{k) is no longer part of the input. 

The encoding is an elaboration of the encoding for Fastfood. Here we define a predicate 
altdepot, which represents an alternative depot assignment. Such an assignment is a witness 
if its distance sum is less than the distance sum of the input depot assignment. 

% A restaurant can be an alternative depot or not. 

altdepot (Re s,D) V notaltdepot(Res , D) :- restaurant(Res , D) ■ 

% The number of alternative depots must be equal to the number of depots. 

:- #count{Dep,D : depot{Dep,D)} —N, 

not =fl=count{Dep,D : altdepot{Dep, D)} — N- 
% Determine the serving input depot for each restaurant. 
serves(Dep, Res, D) :- restaurant {Res, ResD) , depot{Dep , DepD), 

distance {Re sD, DepD, D), 

#min{y : depot{Depl,DepDl),distance{DepD\,ResD, Y)} = D- 
% Determine the serving alternative depot for each restaurant. 
altserves{Dep , Res, D) :- restaurant{Res, ResD) , altdepot{Dep, DepD), 

distance{ResD, DepD, D), 

^m±n{Y : altdepot{Depl, DepDl) , distance{DepDl, ResD, ¥)} — D- 
% Accept an alternative solution only if its supply costs are not greater or 
% equal than the supply costs for the input candidate. 
:~ ^sum{D,Res : serves{Dep,Res,D)} — Cost, 

^sum.{D, Res : altserves{Dep,Res, D)} > Cost- 
% Auxiliary predicate. 

distance{X, Y, D) :- restaurant{Resl,X), restaurant {Re s2, Y),X > Y,X — Y + D- 
distance{X, Y, D) :- restaurant{Resl,X), restaurant{Res2,X),X < Y,Y = X + D- 

^'^ See |http : / /asparagus . cs . uni-potsdam. de/ contest /| . 
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It should be noted that this encoding relies heavily on assignment aggregates. The first 
constraint determines the cardinality of the input predicate depot using an assignment ag- 
gregate and makes sure that any alternative assignment has the same cardinality. The final 
constraint also employs an assignment aggregate, in this case not directly involving an 
input predicate, but a predicate which has a deterministic definition {serves) and which 
involves yet another aggregate. In fact, it is unclear if and how this constraint could be 
encoded without an assignment aggregate, as the range for Cost is not known or bounded 
a priori. 

4 Computational Complexity of DLP-^ 

As for the classical non-monotonic formalisms ( |Marek and Truszczyriski 1991| l, two im- 
portant decision problems, corresponding to two different reasoning tasks, arise in DLP'^: 

Brave Reasoning: Given a DLP-^ program V and a ground literal L, is L true in 
some answer set of P? 

Cautious Reasoning: Given a DLP-^ program V and a ground literal L, is L true in 
all answer sets of P? 

The following theorems report on the complexity of the above reasoning tasks for propo- 
sitional (i.e., variable-free) DLP-^ programs that respect the safety restrictions imposed in 
Section m Importantly, it turns out that reasoning in DLP"^ does not bring an increase in 
computational complexity, which remains exactly the same as for standard DLP We begin 
with programs without weak constraints, and then discuss the complexity of full DLP-^ 
programs including weak constraints. 

Lemma 1 

Deciding whether an interpretation M is an answer set of a ground program V without 
weak constraints is in co-NP. 

Proof 

We check in NP that M is not an answer set of V as follows. Guess a subset / of M, and 
verify that: (1) M is not a model of V, or (2) / C M and / is a model of V'^ , the Gelfond- 
Lifschitz transform of V w.rt. M. 

The only difference w.r.t. the corresponding tasks of (1) and (2) in standard DLP is the 
computation of the truth valuations of the aggregate atoms, which in turn require to com- 
pute the valuations of aggregate functions and sets. Computing the valuation of a ground 
set T requires scanning each element (fi, • • •, f„ : Conj) of T and adding ti to the re- 
sult multiset if Conj is true w.rt. /. This is evidently polynomial, as is the application of 
the aggregate operators (#count, #min, #max, #sum, #times) on a multiset. The 
comparison of this result against the guards, finally, is straightforward. 

Therefore, the tasks (1) and (2) are tractable as in standard DLP. Deciding whether M 
is not an answer set of V thus is in NP; consequently, deciding whether M is an answer set 
of -P is in co-NP. □ 

Based on this lemma, we can identify the computational complexity of the main decision 
problems, brave and cautious reasoning. 
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Theorem 10 

Brave Reasoning on ground DLP-^ programs without weak constraints is Sj-complete. 
Proof 

We verify that a ground Uteral L is a brave consequence of a DLP-^ program V as follows: 
Guess a set M C B-p of ground atoms and check that (1) M is an answer set of V and (2) 
L is true w.rt. M. Task (2) is clearly polynomial, while (1) is in co-NP by virtue of Lemma 
[U The problem therefore lies in Ej- 

I]2-hardness follows from the Sj-hardness of DLP ( |Eiter, Gottlob, and Mannila 1997| ), 
since DLP-^ is a superset of DLP. □ 

The complexity of cautious reasoning follows by similar arguments as above. 

Theorem 11 

Cautious Reasoning on ground DLP-^ programs without weak constraints is n2-complete. 

Proof 

We verify that a ground literal L is not a cautious consequence of a DLP-^ program V as 
follows: Guess a set M C B-p of ground atoms and check that (1) M is an answer set of 
V and (2) L is not true w.r.t. M. Task (2) is clearly polynomial, while (1) is in co-NP, by 
virtue of Lemma[T| Therefore, the complement of cautious reasoning is in , and cautious 
reasoning is in IIj. 

Ha-hardness again follows from (lEiter and Gottlob 1995l l. since DLP-^ is a superset of 
DLP. □ 

From these results we can derive the results for DLP-^ with weak constraints. 
Theorem 12 

For a ground DLP-^ program V, deciding whether an interpretation M is an answer set is 
Ilj-complete, while brave and cautious reasoning are both Ag-complete. 

Proof 

The key to this proof is that one can rewrite each DLP-^ program V to another DLP-^ 
program W{V) in which no aggregates occur in weak constraints, by replacing each ag- 
gregate literal that occurs in a weak constraint by a new standard atom, and adding a rule 
with the aggregate literal in the body and the new atom in the head. 

Hardness for the result follows directly from item (3) of Theorem 4. 14 in (ILeone et al. 20061) . 
For membership, we show that deciding whether an interpretation M is not an answer set 
is Sf. We consider W{V) and M' , which is obtained from M by adding those new atoms 
that replaced aggregate literals that are true w.r.t. M. We first test whether M' is an answer 
set of Rules{W{V)), which is in co-NP by Lemma[T] If M' is not an answer set, we stop 
and return yes. Otherwise we determine the cost c of M' in polynomial time, and guess an 
M" C Bp. We check that M" is an answer set of Rules{W{V)) by a single call to an NP 
oracle, and check that the cost of M" is less than c in polynomial time. 

For the Ag results, hardness is an immediate consequence of Theorem 4. 8 in (ILeone et al. 2006l l. 
Membership can be shown exactly as in the proof of Theorem 4.8 in ( ILeone et al. 2006l l. 
using 'W{V) and the fact that the necessary oracle for determining whether an interpreta- 
tion is an answer set of V, the cost of which is less than a fixed bound, is also in this 
case, as argued above. □ 




Fig. 1. The System Architecture of DLV 



The above theorems confirm that our addition of aggregates to disjunctive logic pro- 
gramming does not cause any increase in the computational complexity of the language, 
and the same holds even if weak constraints are allowed. 

We end this section by discussing the complexity of non-ground programs. The prob- 
lems with respect to data-complexity for DLP-^ programs (i.e. a program V is fixed, while 
the input consists of a set of facts) have the same complexity as for propositional pro- 
grams. Concerning program complexity (i.e. a program V is given as input), complexity 
rises in a similar manner as for aggregate-free programs. A non-ground program V can 
be reduced, by naive instantiation, to a ground instance of the problem, the size of which 
is single exponential in the size of V. Informally, the complexity results thus increase ac- 
cordingly by one exponential, from co-NP to co-NEXPTIME, to NEXPTIME'^p, to 
co-NEXPTIME'^^, and A3 to EXPTIME^^ . These results can be derived using complexity 
upgrading techniques as presented in ( |Eiter, Gottlob, and Mannila 1997[|Gottlob et al. 1999I I. 



5 Implementation Issues 

In this section we illustrate the design of the implementation of aggregates in the DLV 
system. We first briefly describe the overall architecture of DLV, and we then discuss the 
impact of the implementation of aggregates in the system. 



5.7 DLV Architecture 

An outline of the general architecture of the DLV system is depicted in Figure[T] It includes 
four front-ends for solving domain-oriented tasks; these are implemented on top of the 
DLV core by means of suitable rewriting techniques to DLP. Clearly, the implementation 
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of aggregates does not affect these front-ends, even if the availability of the aggregates will 
allow to enhance the front-ends and improve the expressiveness of their languages. 

Instead, the implementation of aggregates heavily affects the DLV core, which we de- 
scribe next. Input data can be supplied by regular files, and also by relational databases. 
The DLV core then produces answer sets one at a time, and each time an answer set is 
found, the "Filtering" module is invoked, which performs post-processing (dependent on 
the active front-ends) and controls continuation or abortion of the computation. 

The DLV core consists of three major components: the "Intelligent Grounding", the 
"Model Generator", and the "Model Checker" modules that share a principal data structure, 
the "Ground Program". The "Ground Program" is created by the "Intelligent Grounding" 
using differential (and other advanced) database techniques together with suitable data 
structures, and used by the "Model Generator" and the "Model Checker". The Ground 
Program is guaranteed to have exactly the same answer sets as the original program. For 
some syntactically restricted classes of programs (e.g. stratified programs), the "Intelligent 
Grounding" module already computes the corresponding answer sets. 

For harder problems, most of the computation is performed by the "Model Genera- 
tor" and the "Model Checker". Roughly, the former produces some candidate answer sets 
(models) ( |Faber, Leone, and Pfeifer 1999[ IFaber et al. 2001l l. the stability and minimality 
of which are subsequently verified by the latter 

The "Model Checker" (MC) verifies whether the model at hand is an answer set. This 
task is very hard in general, because checking the stability of a model is known to be co-NP- 
complete. However, MC exploits the fact that minimal model checking — the hardest part 
— can be efficiently performed for the relevant class of head-cycle-free (HCF) programs 
(|Ben-Eliyahu and Dechter 1994|lLeone et al. 1997ll. 



5.2 Implementation of Aggregates in DLV 

Implementing aggregates in the DLV system, has had a strong impact on DLV requiring 
many changes to the modules of the DLV core, and, especially, to the "Intelligent Ground- 
ing" (IG) and to the "Model Generator" (MG) modules. We next describe the main changes 
carried out in the modules of DLV core to implement aggregates. 



5.2.7 Intelligent Grounding 

The changes performed in the Intelligent Grounding module to implement aggregates in 
DLV can be summarized in three main activities: Standardization, Instantiation Procedure 
(the main task), and Duplicate Sets Recognition. 

Standardization. After parsing, each aggregate A is transformed such that both guards 
are present and both and ^2 are set to <. The conjunction Conj of the symbolic set of 
A is replaced by a single, new atom Aux and a rule Aux :- Conj is added to the program 
(the arguments of Aux being the distinct variables of Conj). 

Instantiation Procedure. The goal of the instantiator is to generate a ground program 
that has precisely the same answer sets as the theoretical instantiation Ground{V), but 
is as small as possible. The instantiation of standard DLV proceeds bottom-up following 
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the dependencies induced by the rules, and, in particular, respecting the ordering imposed 
by negation-stratification where this is possible. DLV's instantiator produces only those in- 
stances of a predicate which can potentially become true ( |Faber, Leone, Mateis, and Pfeifer 1999t 
ILeone et al. 2001l l. where a ground atom A can potentially become true only if we have 
generated or may generate a ground instance of a rule with A in the head. Ground atoms, 
which have determined to be true or false in any answer set, are instead partially evalu- 
ated, that is if a literal it occurs in is true, that literal is omitted from the ground rule to be 
generated; if that literal is false, the ground rule it would occur in will not be generated. 

For programs containing stratified aggregates strategy is extended such that the order 
of processing respects aggregate stratification. In this way, any truth-values (true, false or 
potentially true) of nested atoms, which can be determined during grounding, have akeady 
been determined before the aggregate atom itself is instantiated. 

When processing a rule containing an aggregate atom we proceed as follows. Assume 
that "H :- B, aggr" is the rule r which is to be processed, where H is the head of the rule, 
B is the conjunction of the standard body literals in r, and aggr is a standardized aggregate 
literal over a symbolic set {VarsAux}. First we compute an instantiation B for the literals 
in B; this also binds the global variables appearing in Aux. The (partially bound) atom Aux 
is then matched against its extension (which is already available as the computation follows 
aggregate-stratification as discussed above), all matching facts are collected, and a set of 
pairs 

{{ei{Vars):ei(AiIx)),- ■ •, {en{Vars):e„(AiIx))} 

is generated, where 6j is a substitution for the local variables in Aux such that Oi{Aux) is a 
potentially true instance of Aux. For all a{Aux) which are true or false instances of Aux, the 
aggregate is partially evaluated, which is done by methods that depend on the aggregate 
function involved. Note that in this way aggregates will only ground atoms the truth-value 
of which can not be determined already by the instantiator. 

Note that for several classes of programs, the instantiator guarantees complete evalua- 
tion. If a predicate is defined by a subprogram of such a class, no ground atom of it will be 
generated. In particular, if the predicate Aux of a standardized aggregate is defined by such 
a program, the aggregate function can be fully evaluated by the instantiator. One notable 
class, for which this is possible, are non-disjunctive negation-stratified programs. There- 
fore, by the definition of assignment aggregates, the value of the aggregate function inside 
an assignment aggregate can always be determined by the instantiator, thus providing a 
binding for the assigned variable (or no binding if the function evaluates to _L). An as- 
signment aggregate thus is treated like a unary positive atom which has at most one true 
matching instance. 

If a non-assignment aggregate literal can be fully evaluated by the instantiator, its truth- 
value will be determined by computing the value of the aggregate function and comparing 
it to the guards. If it evaluates to true, it is removed from the ground rule, if it evaluates to 
false, the ground rule is simply discarded, thus partially evaluating the ground rule an the 
aggregate literal. 

The same process is then repeated for all further instantiations of the literals in B. 
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Consider the rule r: 

p{X) :-q{X),l< #count{y : a(X, Y),notb{Y)}- 

The standardization rewrites r to: 

p{X) :- q{X), 2 < #count{y ; aux{X, Y)} < oo- 
aux{X, Y) :- a{X, Y), not fc(F)- 

Suppose that the instantiation of the rule for aux generates 3 potentially true facts aux{l,a), 
aux{l,b), and aux{2, c). If the potentially true facts for q are q{l) and ^(2), the following 
ground instances are generated: 

p{l) :-<7(l), 2< #count{(fl:flMx(l,fl)}, {b:aux{l,b))}<oo- 
p{2) ■.-q{2),2< #count{(c : aux{2,c))} < oo- 

Note that a ground set contains only those aux atoms which are potentially true. ■ 

Duplicate Sets Recognition. To optimize the evaluation during instantiation and espe- 
cially afterward, we have designed a hashing technique which recognizes multiple occur- 
rences of the same set in the program, even in different rules, and stores them only once. 
This saves memory (sets may be very large), and implies a significant performance gain, 
especially during model generation where sets are frequently manipulated by the back- 
tracking process. 

Example 14 

Consider the following two constraints: 

ci ■■ :- 10 < #max{y : d{V,X)}- 
02-. :- #min{y : d{Y,Z)} < 5- 

Our technique recognizes that the two sets are equal, and generates only one instance which 
is shared by ci and C2. 

To see the impact of this technique, consider a situation in which the two constraints 
additionally contain another standard Uteral p{T): 

C3: :- p(r),10 < #max{y : c/(y,X)}- 
C4: :- p{T),#min{Y -.diY,!)} <5- 

Here, C3 and C4 have n instances each, where n is the number of potentially true atoms 
matching p{T). By means of our technique, all instances of the constraint atoms in C3 and 
C4 share one common set, reducing the number of instantiated sets from 2 * n to 1. ■ 



5.2.2 Model Generator 

In our implementation, an aggregate atom will be assigned a truth- value just like a stan- 
dard atom. However, different from a standard atom, its truth-value also depends on the 
valuation of the aggregate function and thus on the truth-value of the nested predicates. 
Therefore, an aggregate atom adds an implicit constraint on models and answer sets: The 
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truth-value assigned to the aggregate atom must correspond to the truth-value obtained by 
the valuation. 

We have designed an extension of the Deterministic Consequences operator of the DLV 
system ( Faber, Leone, and Pfeifer 1999[IFaber 2002l l for DLP-^ programs which accounts 
for these additional implicit constraints. As for rules, we differentiate between "forward 
propagation" (when an aggregate atom is assigned a truth-value because of the valuation 
of its aggregate function) and "backward propagation" (when a nested atom is derived in 
order to make the valuation of the aggregate atom compliant with its assigned truth-value). 

We have extended the Dowling and Gallier algorithm ( jPowling and Gallier 1984) l (in 
the version of (IMinoux 19881 1 ) to deal with aggregates, and we compute the fixpoint of 
the enhanced Deterministic Consequences operator in linear time. To achieve this, we have 
endowed aggregate atoms with datastructures similar to those used in rules. In particular, all 
aggregate atoms have a lower and upper bound holding the minimum and maximum value 
of the aggregate function w.rt. the interpretation at hand to efficiently determine whenever 
an aggregate atom becomes true or false. #min and #max hold additional values for 
differentiating between undefined and true nested atoms. Moreover, for each standard atom 
we keep an index of aggregate sets in which it occurs to update these counters in an efficient 
way. 

Forward propagation can then be achieved comparatively easily: whenever a standard 
atom is assigned a truth-value (other than undefined), the bounds and additional data of all 
aggregate functions it occurs in are updated. Where the bound range is fully covered by the 
guard range, the aggregate atom is derived as true. If the bound range and the guard range 
do not intersect, it is derived as false. For backward propagation, whenever an aggregate 
atom gets a truth-value other than undefined or a non-undefined aggregate atom has an 
update of its bounds, several checks for inferences are performed, dependent on the type 
of aggregate function. For example, if there exists a tuple {k, . . . : a) in a ground #sum 
aggregate which is true, such that a is undefined and the lower bound plus k is greater than 
the upper guard, then a can be derived as false. In order to make these checks efficient, the 
set of entries in the ground aggregate set is stored in a structure which is ordered on the 
projected term. 

Example 15 

Let us consider some of the propagations that are done for the following ground program. 

fl(l)Vfo(l)- a{2)\fb{2)- 
:-#sum{(l:fl(l)),(2:fl(2))} < 3- 
ci:-#count{(l:a(l)),(2:fl(2))} > 2- 
c{l):-cs- c(2) Vc(3):-ci- :-c(l),J(l)- 
fl'(2):-#min{(l:c(l)),(2:c(2)}, (3:c(3))} < 2- 
fl'(l):-#max{(l:c(l)),(2:c(2)}, (3:c(3))} > 3- 

At the very beginning, the internal datastructures of the aggregate atoms are initialized. 
#sum{(l : (2 : fl(2))} < 3 gets bounds [0, 3] and guards [0, 2] (the guards are nor- 

malized to be inclusive). In a similar way, 7^count{(l :«(!)), (2 :fl(2))} > 2 gets bounds 
[0,2] and guards [2,oo]. #min{(l : c(l)}, (2 : c(2)), (3 : c(3))} < 2 is initialized with 
bounds [— cxo, +oo] (because the value of the aggregate function may become undefined) 
and guards [0, 1], and in addition minTrue = oo and minUndef — 1 for keeping track 
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of possible minima. In a similar way, ^max{{l : c(l)), (2 : c(2)), (3 : c(3)}} > 3 is ini- 
tialized with bounds [— oo, +c»], guards [3, oo], and special values maxTrue — — oo and 
maxUndef = 3. 

In the first step, #suin{(l : fl(l)), (2 : a{2))} < 3 is derived false in order to satisfy the 
first constraint. In order to look for possibilities for backward propagation, the elements of 
the multiset are examined in a descending order, beginning with the largest one. For each 
element, we check whether the bound minus the element value is less than or equal to the 
upper guard, as the condition of these elements must become true. So for (2 : a(2)), we 
obtain 3 — 2 < 2 and we make a derivation establishing the fact that a (2) must be true. In 
a similar manner, we obtain that a(l) must be true. Since both a{l) and a(2) each occur 
in a single rule head, they are derived as definitely true, being supported by the respective 
rule, which in turn causes b{l) and b{2) to be derived as false. Moreover, the truth of a{2) 
causes the bounds of 7^count{(l : a{l)), (2 : fl(2))} > 2 to become [1, 2], which due to 
the truth of a{l) then become [2, 2], causing the aggregate atom to become true. 

As a consequence, also cs and c(l) become true, while c(2) and c(3) remain undefined. 
So in #min{(l :c(l)}, (2:c(2)), (3:c(3)}} < 2, minTrue becomes 1, while minUndef 
becomes 2, so its bounds become [1,1], and the aggregate atom becomes true, causing also 
d{2) to become true. For ^maK{{l : c(l)), (2 : c(2)}, (3 : c(3))} > 3, maxTrue becomes 
1 and maxUndef becomes 3, causing the bounds to become [1,3]. 

Moreover, d{l) becomes false because of the constraint c{l),d{l)- Therefore, the 
aggregate atom #max{(l : c(l)), (2 : c(2)), (3 : c(3))} > 3 is derived as false. We then 
examine the elements of the multiset, starting with the greatest. If a condition of the element 
is undefined and its value is between the guards (inclusively), that condition must become 
false. In our example, for (3 : c(3)) this holds (the guards are [3, oo]) and so we derive the 
falsity of c(3). For (2 : c(2)) we cannot do this, as 2 is not within the guards. Indeed, c(2) 
is eventually derived true in order to satisfy the rule c(2) V c(3) cs. 

In this example, the Deterministic Consequence operator has thus already determined 
the answer set {fl(l), a(2), cs, c(l), c(2), d{2)}, as no undefined atoms are left. 



5.2.3 Model Checker 

The stratification constraint that we have imposed on DLP-^ aggregates, allows us to treat 
aggregate literals as negative literals in the reduct (see Section 12.3b . and minimize the 
impact of aggregates on answer set checking. 

The Model Checker (MC) receives a model M in input, and checks whether M is an 
answer set of the instantiated program V (see Subsection lS.lb . To this end, it first computes 
the reduct V'^ , by (i) deleting the rules having a false aggregate literal or a false negative 
literals (w.rt. M) in their bodies, and (ii) removing the aggregates literals and the negative 
literals from the bodies of the remaining rules. Since the resulting program is aggregate- 
free, the standard DLV techniques can then be applied to check whether V'^ is an answer 
set. Thus, no further change is needed in MC, after the modification of the procedure 
computing the reduct. 
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6 Experiments and Benchmarks 

6.1 Compared Methods, Problems and Data 

To assess the usefulness of the proposed DLP language extension and evaluate its imple- 
mentation, we compare the following two methods on some relevant benchmark problems: 

DLV-^ Encode each problem in DLP-^ and solve it using our extension of DLV 
with aggregates. 

DLV Encode the problem in standard DLP and solve it using standard DLV. 

To generate DLP encodings from DLP-^ encodings, suitable logic defi- 
nitions of the aggregate functions are employed (which are recursive for 

#count, #sum, and ^/^times). 

We compare these methods on three benchmark problems: Time Tabling, Seating, and 
Fastfood. Time Tabling is a classical scheduling problem. In particular, we consider the 
problem of scheduling the timetable of lectures which some groups of students have to take 
using a number of real-world instances from the University of Calabria where instance k 
deals with k groups of students. 

Seating is the problem described in Section [3] We consider four (for small instances 
with at most four tables) or five (for larger instances with at least five tables) seats per 
table, with increasing numbers of tables and persons (and numPersons = numSeats * 
numTables). 

For each problem size (i.e., seats per tables/tables configuration), we consider classes 
with different numbers of hke and dislike constraints, where the percentages are relative to 
the maximum numbers of like and dislike constraints, respectively, such that the problem 
is not over-constrainedl^ In particular, we consider the following classes: 

• no like/dislike constraints at all; 

• 25% like constraints; 

• 25% like and 25% dislike constraints; 

• 50% like constraints; 

• 50% like and 50% dislike constraints. 

For each problem size, we randomly generated 10 instances for each of these classes, 50 
instances for each problem size overall. 

We use the DLP-^ encoding reported in Section [3] All encodings and benchmark data 
are available on the web at http : / /www . dlvsystem . com/examples / in the files 
aggregates-timetabling.zip, aggregates-seating.zip, and aggregates-fastfood.zip. 

Fastfood is the problem described in Section [3] The concrete instances consist of ser- 
vice station data of the company "Tank&Rast" which runs the majority of service sta- 
tions on German motorways. This data has been obtained from the company website 
|http : / / www . rast . de/ standorte/| 

The instances are grouped by motorway and vary over the number of depots to be built, 
ranging from to the total number of restaurants along the motorway. The maximum length 
of any motorway is 910 kilometers, the maximum number of restaurants per motorway is 
49. 



Beyond these maxima there is trivially no solution. 
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Table 1 . Experimental Results for Timetabling 
Number of Execution Time 
Groups DLV DLV-^ 



Instantiation Size 



DLV DLV-^ 



2 
3 
4 
5 
6 
7 
8 
9 



3.45 
12.40 
32.63 
59.49 
90.93 
129.44 
153.30 
216.12 



0.22 
0.77 
1.57 
2.73 
4.18 
5.76 
7.98 
11.70 
16.51 



91337 
178756 
265250 
367362 
437018 
519568 
607099 
762026 
944396 



7092 
14209 
21200 
29377 
36517 
43385 
50731 
62513 
74772 



6.2 Results and Discussion 



We ran all benchmarks on an Intel dual Xeon 3GHz machine, using Debian GNU/Linux 
sarge with kernel version 2.4.27 and DLV release 2006-07-14. We allowed a maximum 
running time of 1800 seconds per instance and a maximum memory usage of 256MB. 

Cumulated results for Timetabling and Seating are provided in Tables [T] and |2] respec- 
tively. For Timetabling we report the execution time and the size of the residual ground 
instantiation (the total number of atoms occurring in the instantiation, where multiple oc- 
currences of the same atom are counted separately and atoms occurring in the sets of the 
aggregates are considered, too). For Seating, the execution time is the average running time 
over all instances of the same size. 

A "-" symbol in the tables indicates that the corresponding instance (some of the in- 
stances of that size, for Seating) was not solved within the allowed time and memory 
limits. 

On both problems, DLV-^ clearly outperforms DLV. On Timetabling, the execution 
time of DLV'^ is one order of magnitude lower than that of DLV on all problem instances, 
and DLV could not solve the last instances within the allowed memory and time limits. 
On Seating, the difference became even more significant. DLV could solve only instances 
of small size (up to 16 persons - 4 tables, 4 seats per table), while DLV-^ could solve 
significantly larger instances in reasonable time. 

The data on the instantiation sizes provides an explanation for the large difference be- 
tween the execution times of DLV and DLV-^. Thanks to aggregates, the DLP-^ encod- 
ings of Timetabling and Seating are far more succinct than the corresponding encodings 
in standard DLP. This also reflects in the ground instantiations of the programs. Since the 
evaluation algorithms are exponential in the size of the instantiation (in the worst case), the 
execution times of DLV-^ turn out to be much shorter than those of DLV. 

For Fastfood, we report only on motorways yielding hard instances in Figure|2] We have 
omitted the graph for A7, as it is very similar to that of A3. In each graph, the horizontal 
axis represents the number of depots to be built, while the vertical axis stands for exe- 
cution time. For all motorways, we can observe an interesting easy-hard-easy pattern with 
increasing number of depots. This is expected, as most possibilities for placing depots exist 
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Table 2. Experimental Results for Seating 



Number of 
Persons 



Exec. Time 
DLV DLV-' 



Instantiation Size 
DLV DLV-^ 



8 


0.01 


0.01 


228 


72 


12 


0.0155 


0.01 


710 


176 


16 


10.294 


0.01 


1621 


348 


25 




0.01 


4744 


960 


50 




0.0505 


35779 


5443 


75 




0.1869 


118167 


15744 


100 




0.5371 


277035 


34221 


125 




1.2619 


537635 


63358 


150 




2.6204 


925055 


105476 


175 




4.854 


1464260 


162773 



when the number of depots to be built is about half of the number of restaurants. We also 
observe that the average execution times strongly depend on the number of restaurants. 

It is easy to see that the encoding greatly benefits from the use of aggregates: Whenever 
there are instances that cannot be solved within the time limit, the version with aggregates 
manages to solve strictly more instances without timing out. Also when looking at the 
amount of time needed, the version with aggregates is always faster, and the advantage 
becomes more pronounced with rising difficulty of the instances, yielding speedups of up 
to 3 • 5. The computational benefits for this problem are not as dramatic as for Timetabling 
and Seating, but still quite notable. 



7 Related Work 

Aggregates have been studied fairly extensively in the context of databases and logics 
for databases, see dHella et al. 20011 1 for a summary. The logics studied in this setting are 
typically first-order logic endowed with some sort of aggregation operators, which are 
used to express queries. In such logics there is no concept of recursive definitions, and the 
aggregations therefore occur in a stratified way. Moreover, as shown in dHella et al. 20011) . 
the expressivity of these languages suffers from similar limitations as standard first-order 
logics for query answering. 

Aggregate functions in logic programming languages appeared already in the 1980s, 
when their need emerged in deductive databases like LDL dChimenti et al. 1990] l and were 



studied in detail, cf. (Ross and Sagiv 1997 Kemp and Ramamohanarao 1998 1. However, 



the first implementation in Answer Set Programming, in the Smodels system, has been 
fairly recent dSimons et al. 2002T l. 



7.1 Aggregate-Stratification 

The discussion on the "right" semantics for aggregate-unstratified programs is still going 

on in the DLP and Answer Set Programming (ASP) communities. Several proposals have 

been made in the literature, which can roughly be grouped as follows: In ( [Eiter, Gottlob, and Veith 1997t 

IGelfond 20021 IDeirArmi et al. 2003]) . aggregate atoms are basically treated like negative 
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literals. In (INiemela et a l. 1999^. only aggregates involving cardinality and sum are con- 
sidered; as argued in (jFerrans and Lifschitz 2005 ) this semantics is not intuitive for ag- 
gregates which are not monotonic, such as sum aggregates involving negative summands. 
In jPelov 20041 IPelov et al. 20041 l. a family of semantics, which extend completion, sta- 
ble and well-founded semantics, is defined by means of operator fixpoints, approxima- 
tions and transformations; a very similar approach has been given in (ISon et al. 20051 1 and 
(ISon and Pontelh 2007b . In jFaber et al. 2004b . a semantics based on a modified program 
reduct has been defined, for which alternative characterizations have been provided in 
(IFerraris 20051 ICaUmeri et al. 20051 IFaber 2 005 ). All of these four groups of semantics 
differ on certain language fragments; but they coincide on aggregate-stratified programs. 
Finally, in jMarek et al. 2004 | |Liu and Truszczynski 2005| l, semantically restricted aggre- 
gates are considered, on which the newer proposals coincide; but still the first group of se- 
mantics dEiter, Gottlob, and Veith 1997[IGelfond 2002llDeirArmi et al. 20031 differs even 
on these programs. To illustrate the difficulties with unstratified aggregates, we look at a 
simple example: 

Example 16 

Consider the (aggregate-unstratified) program consisting only of the rule 

p{a) :- #count{X : p{X)} = 0- 

As neither {p{a)} nor is an intuitive meaning for the program, one would expect that 
this program admits no answer sets. In this case, the role of the aggregate literal is similar 
to a negative literal. And indeed, approaches like dPelFArmi et al. 20031 IGelfond 20021 
[Eiter, Gottlob, and Veith 1997b treat aggregates like negative literals. 

However, consider a slight modification of this program, containing only the rule 

p{a) :- #count{X : p{X)} > 0- 

If the aggregate is treated like a negative literal, this program allows for two answer 
sets {pia)} and 0. Other approaches dPelov 20041 IFaber et al. 20041 IMarek et al. 20041 



Liu and Truszczynski 2005 1 try to maintain subset minimality and therefore differ on this 



program with respect to jDeirArmi et al. 2003HGelfond 2002l|Eiter, Gottlob, and Veith 1997| ). 
We conclude that this program does not have a semantics which is generally agreed upon. 

Our policy, in the development of DLV, is to keep the system language as much agreed- 
upon as possible, and to try to guarantee a clear and intuitive semantics for the newly intro- 
duced constructs. Thus, we disregard programs which are not aggregate-stratified, leaving 
their introduction in DLV to future workPl 

In addition, we observe that unstratified aggregates may cause a computational overhead. 
For instance, the complexity of brave and cautious reasoning on normal programs without 
weak constraints jumps from NP and co-NP to and IIj, respectively, if unstratified 
aggregates are allowed ( IFerraris 20051 ICalimeri et al. 20()5] l. while it remains in NP and 
co-NP if aggregates are stratified. 



Note that the limitation to aggregate-stratified programs is justified also from philosophical perspectives. For 
instance, defining a class q before defining subsets of q has been recommended by Zermelo, but we will not go 
into details of this aspect. 
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7.2 Comparison to the Language of Lparse 



Very related to DLP-^ is without doubt the language of Lparse ( [Syrjanen 2002 1, which 



serves as a grounding frontend to systems like Smodels (ISimons et al. 2002l l. Cmodels 
(ILierler 20051) . or clasp jGebser et al. 200'7b . which deal with aggregates. We observe a 
strong similarity between cardinality constraints and =i^count, as well as weight con- 
straints and #sum, respectively. Indeed, the DLP-^ encodings of both Team Building and 
Seating can be easily translated to the language of Lparse. However, there are several rele- 
vant differences. 

DLP-^ aggregates like #min, #max, and ^^times do not have a counterpart in the 
language of Lparse. Moreover, DLP-^ provides a general syntactic framework into which 
further aggregates can be easily included. 

In DLP'^ aggregate atoms can be negated, while cardinality and weight constraints in 
the language of Lparse cannot. Negated aggregates are useful for a more direct knowledge 
representation, and allow to express, for instance, that some value should be external to a 
given range. For example, not 3 < #count{X : p{X)} < 7 is true if the number of true 
facts forp is in [0, 3[U]7, oo[; for expressing the same property in Smodels one has to write 
two cardinality constraints. 

Furthermore, note that symbolic sets of DLP-^ directly represent pure sets of term tu- 
ples, and by means of projection DLP"^ can also represent multisets naturally (see the 
discussion on Team Building in Section[3]i. In contrast, cardinality constraints work on sets 
of ground atoms, rather than multisets of terms. For instance. Condition p2 of Team Build- 
ing in Section [3] cannot be directly encoded in a constraint, but needs the definition of an 
auxiliary predicate. Weight constraints, on the other hand, work exclusively on multisets 
of numbers, and additional rules are needed to encode pure sets. 

The language of Lparse requires that each variable has to occur in a positive atom formed 
by a domain predicate which must not be recursive with a head atom — by default a domain 
predicate must be defined by an aggregate-free program. It follows that the language of 
Lparse has no equivalent to assignment aggregates, which prohibits the definition of simple 
concepts such as determining the cardinality of input relations, as discussed in Section[3j 

The language of Lparse does however allow for cardinality and weight constraints in the 
heads of rules, while DLP-^ aggregates may only occur in rule bodies. The presence of 
weight constraints in heads is an interesting feature, which allows, for instance, to "guess" 
an arbitrary subset of a given set. But it causes the loss of some semantic properties of non- 
monotonic languages, see jMarek and Remmel 20021) . Lparse rules having cardinality and 
weight constraints in the head can however be expressed in DLP-^ in the following way: 
The atom to be aggregated over is put into a disjunctive head, which also contains a copy 
of this atom in which the predicate symbol is replaced by a fresh one, keeping the body 
of the original rule augmented by the "domain atom" of the constraint atom. Moreover, an 
integrity constraint is generated, which contains the negated constraint atom (transformed 
into a corresponding DLP-^ aggregate atom) from the original rule head and the body 
of the original rule. Transforming an Lparse program in this way to a DLP-^ program 
(replacing also cardinality and weight constraint atoms in rule bodies by corresponding 
DLP-^ aggregate atoms), the answer sets of the resulting DLP-^ program without atoms 
containing the fresh predicates are precisely the answer sets of the original Lparse program. 
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Moreover, the language of Lparse does allow for aggregate-unstratified programs, with 
the intended semantics of (Niemela et al. 1999] l. As discussed earlier, there is currently no 
consensus about the semantics of aggregate-unstratified programs, and indeed the seman- 
tics of (Niem ela et al. 1999] l has been criticized to yield unintuitive results when weight 
constraints over signed integers are present (IFerraris and Lifschitz 2005l l. 

7.3 Comparison to the language of Smodels'* 

More recently, the system SmodelS'^ has been described in jEUcabani et al. 20051) . Its lan- 
guage is an extension of the language of Lparse which allows for aggregates (possibly not 
aggregate-stratified) under the semantic described in jSon et al. 20051 1. which coincides on 
the semantics of DLP-^ on aggregate-stratified programs. 

The syntax of the additional aggregate constructs allowed in S MODELS'* is more similar 
to the one of DLP"^ (compared to those of Lparse, which are also available in Smodels'*), 
and allows for sum, count, min, max, and also avg, while times is currently not supported. 
In this sense, the Smodels'* can be considered the system, which is most similar to DLP. 

There is, however, one rather crucial difference in the aggregate syntax of SmodelS'*: 
There may be only one term to be aggregated over This means that, for example, the 
following DLP"^ rule has no counterpart in terms of the new aggregate constructs in 
Smodels'*. 

tooexpensive ^sura{Cost, Item : order{Item,Cost)}> ICQ- 

The intended meaning of this rule is that tooexpensive should be derived when the sum 
of the costs of all ordered items exceeds a threshold of 100. Note that here we specified 
two terms to be aggregated over, where the sum will be computed over the first one. This 
is important, as different items may incur the same cost. For instance if order{valve, 60) 
and order(pipe, 60) hold, then tooexpensive should be derived. One may try to write the 
following variant in the syntax of S MODELS'*: 

tooexpensive :- sum{Cost, order{Item, Cost)) > 100- 

However, when order{valve, 60) and order{pipe, 60) hold, tooexpensive would not be de- 
rived, as 60 is summed only once. Currently, there does not seem to be any way of circum- 
venting this problem with the aggregates introduced by S MODELS'*. 

Actually, there is a second problem with the rule mentioned above in the current version 
of S MODELS'*. The way in which the preprocessing is done requires that each variable 
in the aggregate atom is domain restricted by an atom outside the aggregate. In this rule, 
the condition is not met, but it is not possible either to add an atom outside the aggregate 
for domain restricting Item without changing the semantics of the rule. However, in many 
cases these problems can be overcome by writing an equivalent weight constraint in the 
language of Lparse, which are also available in SmodelS"*. 

Other differences between the language of Smodels'* and DLP-^ are that aggregate 
atoms may not occur negated, that all variables must be domain restricted, that each rule 
may contain only one aggregate and that assignment aggregates are not permitted. More- 
over, there is currently no possibility to specify a conjunction of literals (rather than a 
single atom) inside an aggregate atom in S models'*; but one can fairly easily circumvent 
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this limitation by replacing the conjunction by a new atom which is then defined by an 
appropriate rule. These differences are similar to the differences between DLP'^ and the 
language of Lparse. Moreover, given that SmodelS'* relies on Smodels as an engine, it 
also does not support disjunctive rules under the semantics of DLP-^. 

On the system level, the architecture of Smodels'* considerably differs from the one of 
DLV-^. It first preprocesses the input using an algorithm implemented in Prolog, yielding 
an intermediate program. This program is then submitted to Lparse. The output of Lparse 
is subsequently processed by a transformation algorithm (also implemented in Prolog), 
whose output is then submitted to Lparse another time. Finally, Smodels is called on the 
output of the second Lparse invocation to compute the answer sets. The key idea of the 
system is to compile away the aggregates, creating new rules or constraints, which emulate 
the aggregate atoms. In contrast, in the implementation of DLV'^, aggregates are first- 
class citizens and all the internal algorithms of DLV have been updated in order to deal 
with aggregates. 

8 Conclusion 

We have proposed DLP'^, an extension of DLP by aggregate functions count, sum, times, 
min, and max, and have implemented this in the DLV system. On the one hand, we 
have demonstrated that the aggregate functions increase the knowledge modeUng power 
of DLP, supporting a more natural and concise knowledge representation. On the other 
hand, we have shown that aggregate functions do not increase the complexity of the main 
reasoning tasks. In fact, experiments have confirmed that the succinctness of the encodings 
employing aggregates has a strong positive impact on the efficiency of the computation. 

Future work will concern the introduction of further aggregate operators like #any ("Is 
there any matching element in the set?") and ^avg, investigations of a general framework 
that will allow adding further aggregates much more easily, extending semantics to classes 
of programs which are not aggregate- stratified, as well as the design of further optimization 
techniques and heuristics to improve the efficiency of the computation. 

Acknowledgements 

This work has greatly benefited from interesting discussions with and comments by Paolo 
Ferraris, Michael Gelfond, Vladimir Lifschitz, Nikolay Pelov, and from the comments and 
suggestions by the anonymous referees. It was partially supported by M.U.R. under the 
PRIN project "Potenziamento e Applicazioni della Programmazione Logica Disgiuntiva", 
and by M.I.U.R. under internationalization project "Sistemi basati sulla logica per la rap- 
presentazione di conoscenza: estensioni e tecniche di ottimizzazione". Wolfgang Faber's 
work was funded by an APART grant of the Austrian Academy of Sciences. 

References 

Apt, K. R., Blair, H. A., and Walker, A. 1988. Towards a Theory of Declarative Knowl- 
edge. In Foundations of Deductive Databases and Logic Programming, J. Minker, Ed. Morgan 
Kaufmann Publishers, Inc., Washington DC, 89-148. 



Design and Implementation of Aggregate Functions in the DLV System 



31 



Baral, C. 2003. Knowledge Representation, Reasoning and Declarative Problem Solving. Cam- 
bridge University Press. 

Ben-Eliyahu, R. and Dechter, R. 1994. Prepositional Semantics for Disjunctive Logic Pro- 
grams. Annals of Mathematics and Artificial Intelligence 12, 53-87. 

BUCCAFURRI, p., Leone, N., and Rullo, P. 2000. Enhancing Disjunctive Datalog by Con- 
straints. IEEE Transactions on Knowledge and Data Engineering 12, 5, 845-860. 

Calimeri, F., Faber, W., Leone, N., and Perri, S. 2005. Declarative and Computational 
Properties of Logic Programs with Aggregates. In Nineteenth International Joint Conference on 
Artificial Intelligence (IICAI-05). 406^1 1. 

Chimenti, D., Gamboa, R., Krishnamurthy, R., Naqvi, S. a., Tsur, S., and Zaniolo, C. 
1990. The LDL System Prototype. IEEE Transactions on Knowledge and Data Engineering 2, 1. 

Dell'Armi, T., Faber, W., Ielpa, G., Leone, N., and Pfeifer, G. 2003. Aggregate 
Functions in DLV. In Proceedings ASP03 - Answer Set Programming: Advances in The- 
ory and Implementation, M. de Vos and A. Provetti, Eds. Messina, Italy, 274—288. Online at 
http: //CEUR-WS .o rg/Vol-? 8/ 

DOWLING, W. F. AND Gallier, J. H. 1984. Linear-time Algorithms for Testing the Satisfability 
of Propositional Horn Formulae. Journal of Logic Programming 3, 267-284. 

Liter, T., Faber, W., Leone, N., and Pfeifer, G. 2000. Declarative Problem-Solving Using the 
DLV System. In Logic-Based Artificial Intelligence, J. Minker, Ed. Kluwer Academic Publishers, 
79-103. 

ElTER, T. AND GOTTLOB, G. 1995. On the Computational Cost of Disjunctive Logic Programming: 
Propositional Case. Annals of Mathematics and Artificial Intelligence 15, 3/4, 289-323. 

ElTER, T., GOTTLOB, G., AND Mannila, H. 1997. Disjunctive Datalog. ACM Transactions on 
Database Systems 22, 3 (Sept.), 364-418. 

ElTER, T., GOTTLOB, G., AND Veith, H. 1997. Modular Logic Programming and Generalized 
Quantifiers. In Proceedings of the 4th International Conference on Logic Programming and Non- 
monotonic Reasoning (LPNMR-97), J. Dix, U. Furbach, and A. Nerode, Eds. LNCS, vol. 1265. 
Springer, 290-309. 

Elkabani, L, Pontelli, E., and Son, T. C. 2005. Smodels'* — a system for computing answer 
sets of logic programs. In Logic Programming and Nonmonotonic Reasoning — 8th International 
Conference, LPNMR'05, Diamante, Italy, September 2005, Proceedings, C. Baral, G. Greco, 
N. Leone, and G. Terracina, Eds. Lecture Notes in Computer Science, vol. 3662. Springer Ver- 
lag, 427-431. 

Faber, W. 2002. Enhancing Efficiency and Expressiveness in Answer Set Programming Systems. 
Ph.D. thesis, Institut fiir Informationssysteme, Technische Universitat Wien. 

Faber, W. 2005. Unfounded Sets for Disjunctive Logic Programs with Arbitrary Aggregates. In 
Logic Programming and Nonmonotonic Reasoning — 8th International Conference, LPNMR'05, 
Diamante, Italy, September 2005, Proceedings, C. Baral, G. Greco, N. Leone, and G. Terracina, 
Eds. Lecture Notes in Computer Science, vol. 3662. Springer Verlag, 40-52. 

Faber, W., Leone, N., Mateis, C, and Pfeifer, G. 1999. Using Database Optimization Tech- 
niques for Nonmonotonic Reasoning. In Proceedings of the 7th International Workshop on De- 
ductive Databases and Logic Programming (DDLP'99), INAP Organizing Committee, Ed. Prolog 
Association of Japan, 135-139. 

Faber, W., Leone, N., and Pfeifer, G. 1999. Pushing Goal Derivation in DLP Computations. 
In Proceedings of the 5th International Conference on Logic Programming and Nonmonotonic 
Reasoning (LPNMR'99), M. Gelfond, N. Leone, and G. Pfeifer, Eds. Lecture Notes in AI (LNAI), 
vol. 1730. Springer Verlag, El Paso, Texas, USA, 177-191. 

Faber, W., Leone, N., and Pfeifer, G. 2001. Experimenting with Heuristics for Answer Set 
Programming. In Proceedings of the Seventeenth International Joint Conference on Artificial 
Intelligence (IJCAI) 2001. Morgan Kaufmann Publishers, Seattle, WA, USA, 635-640. 

Faber, W., Leone, N., and Pfeifer, G. 2004. Recursive aggregates in disjunctive logic pro- 



32 



W. Faber et al. 



grams: Semantics and complexity. In Proceedings of the 9th European Conference on Artificial 
Intelligence (JELIA 2004), J. J. Alferes and J. Leite, Eds. Lecture Notes in AI (LNAI), vol. 3229. 
Springer Verlag, 200-212. 

Ferraris, P. 2005. Answer Sets for Propositional Theories. In Logic Programming and Nonmono- 
tonic Reasoning — 8th International Conference, LPNMR'05, Diamante, Italy, September 2005, 
Proceedings, C. Baral, G. Greco, N. Leone, and G. Terracina, Eds. Lecture Notes in Computer 
Science, vol. 3662. Springer Verlag, 119-131. 

Ferraris, P. and Lifschitz, V. 2005. Weight constraints as nested expressions. Theory and 
Practice of Logic Programming 5, 1-2, 45-74. 

Gebser, M., Kaufmann, B., Neumann, A., and Schaub, T. 2007. Conflict-driven answer set 
solving. In Twentieth International Joint Conference on Artificial Intelligence (UCAI-07). Morgan 
Kaufmann Publishers, 386-392. 

Gebser, M., Liu, L., Namasivayam, G., Neumann, A., Schaub, T., and Truszczynski, 
M. 2007. The first answer set progranmiing system competition. In Logic Prograrruning and 
Noiunonotonic Reasoning — 9th International Conference, LPNMR '07, C. Baral, G. Brewka, and 
J. Schlipf, Eds. Lecture Notes in Computer Science, vol. 4483. Springer Verlag, Tempe, Arizona, 
3-17. 

Gelfond, M. 2002. Representing Knowledge in A-Prolog. In Computational Logic. Logic Pro- 
gramming and Beyond, A. C. Kakas and F. Sadri, Eds. LNCS, vol. 2408. Springer. 41 3^51. 

Gelfond, M. and Lifschitz, V. 1991. Classical Negation in Logic Programs and Disjunctive 
Databases. New Generation Computing 9, 365-385. 

GOTTLOB, G., Leone, N., and Veith, H. 1999. Succinctness as a Source of Expression Com- 
plexity. Annals of Pure and Applied Logic 97, 1-3, 231-260. 

Hella, L., Libkin, L., Nurmonen, }., and Wong, L. 2001. Logics with aggregate operators. 
Journal of the ACM 48, 4, 880-907. 

Kemp, D. B. and Ramamohanarao, K. 1998. Efficient Recursive Aggregation and Negation in 
Deductive Databases. IEEE Transactions on Knowledge and Data Engineering 10, 727-745. 

Leone, N., Perri, S., and Scarcello, F. 2001. Improving ASP Instantiators by Join-Ordering 
Methods. In Logic Programming and Nonmonotonic Reasoning — 6th International Conference, 
LPNMR'Ol, Vienna, Austria, T. Eiter, W. Faber, and M. Truszczynski, Eds. Lecture Notes in AI 
(LNAI), vol. 2173. Springer Veriag, 280-294. 

Leone, N., Pfeifer, G., Faber, W., Eiter, T., Gottlob, G., Perri, S., and Scarcello, 
F. 2006. The DLV System for Knowledge Representation and Reasoning. ACM Transactions on 
Computational Logic 7, 3 (July), 499-562. 

Leone, N., Rullo, P., and Scarcello, F. 1997. Disjunctive Stable Models: Unfounded Sets, 
Fixpoint Semantics and Computation. Information and Computation 135, 2 (June), 69-112. 

LlERLER, Y. 2005. Cmodels for Tight Disjunctive Logic Programs. In W(C)LP 19th Workshop on 
(Constraint) Logic Programming, Ulm, Germany. Ulmer Informatik-Berichte. Universitat Ulm, 
Germany, 163-166. 

Liu, L. and Truszczynski, M. 2005. Properties of programs with monotone and convex con- 
straints. In Proceedings of the 20th National Conference on Artificial Intelligence (AAAr05), 
M. M. Veloso and S. Kambhampati, Eds. 701-706. 

Marek, V. W., Niemela, L, and Truszczynski, M. 2004. Logic Programming with Monotone 
Cardinality Atom. In Proceedings of the 7th International Conference on Logic Programming 
and Non-Monotonic Reasoning (LPNMR-7), V. Lifschitz and I. Niemela, Eds. LNAI, vol. 2923. 
Springer, 154-166. 

Marek, V. W. and Remmel, J. B . 2002. On Logic Programs with Cardinality Constraints. In Pro- 
ceedings of the 9th International Workshop on Non-Monotonic Reasoning (NMR'2002), S. Ben- 
ferhat and E. Giunchiglia, Eds. Toulouse, France, 219-228. 

Marek, V. W. and Truszczynski, M. 1991. Autoepistemic Logic. Journal of the ACM 38, 3, 
588-619. 



Design and Implementation of Aggregate Functions in the DLV System 33 



MiNKER, J. 1982. On Indefinite Data Bases and the Closed World Assumption. In Proceedings 
6''' Conference on Automated Deduction (CADE '82), D. W. Loveland, Ed. Lecture Notes in 
Computer Science, vol. 138. Springer, New York, 292-308. 

MiNOUX, M. 1988. LTUR: A Simplified Linear-time Unit Resolution Algorithm for Horn Formulae 
and Computer Implementation. Information Processing Letters 29, 1-12. 

NiEMELA, I., Simons, R, and Soininen, T. 1999. Stable Model Semantics of Weight Constraint 
Rules. In Proceedings of the 5th International Conference on Logic Programming and Nonmono- 
tonic Reasoning (LPNMR'99), M. Gelfond, N. Leone, and G. Pfeifer, Eds. Lecture Notes in AI 
(LNAI), vol. 1730. Springer Verlag, El Paso, Texas, USA, 107-116. 

Pelov, N. 2004. Semantics of Logic Programs with Aggregates. Ph.D. thesis, Katholieke Univer- 
siteit Leuven, Leuven, Belgium. 

Pelov, N., Denecker, M., and Bruynooghe, M. 2004. Partial stable models for logic pro- 
grams with aggregates. In Proceedings of the 7th International Conference on Logic Programming 
and Non-Monotonic Reasoning (LPNMR-7). Lecture Notes in AI (LNAI), vol. 2923. Springer, 
207-219. 

Przymusinski, T. C. 1988. On the Declarative Semantics of Deductive Databases and Logic 

Programs. In Foundations of Deductive Databases and Logic Programming, J. Minker, Ed. Morgan 

Kaufmann Publishers, Inc., 193-216. 
Ross, K. A. AND Sagiv, Y. 1997. Monotonic Aggregation in Deductive Databases. loumal of 

Computer and System Sciences 54, 1 (Feb.), 79-91. 
Simons, P., Niemela, L, and Soininen, T. 2002. Extending and Implementing the Stable Model 

Semantics. Artihcial Intelligence 138, 181-234. 
Son, T. C. and Pontelli, E. 2007. A Constructive Semantic Characterization of Aggregates in 

ASP. Theory and Practice of Logic Programming 7,355-375. 
Son, T. C, Pontelli, E., and Elkabani, I. 2005. On Logic Programming with Aggregates. 

Tech. Rep. NMSU-CS-2005-006, New Mexico State University. 
Syrjanen, T. 2002. Lparse 1.0 User's Manual, http : / /www . tcs . hut . fi /Software/ smodels/lparse . ps . gz[ 
Ullman, J. D. 1989. Principles of Database and Knowledge Base Systems. Computer Science 

Press. 



34 



W. Faber et al. 



Fig. 2. Results for Fastfood on German Motorways A1-A5 and A8 
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